<!doctype html>
<html lang="en">
 <head>
  <meta charset="utf-8">
  <meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1">
  <title>强化学习: Gridworld with Dynamic Programming</title>
  <meta name="description" content="">
  <meta name="author" content="">
  <meta name="viewport" content="width=device-width, initial-scale=1.0">

  <!-- jquery and jqueryui -->
  <script src="external/jquery-2.1.3.min.js"></script>
  
  <!-- bootstrap -->
  <script src="external/bootstrap.min.js"></script>
  <link href="external//bootstrap.min.css" rel="stylesheet">

  <!-- markdown -->
  <script type="text/javascript" src="external/marked.js"></script>
  <script type="text/javascript" src="external/highlight.pack.js"></script>
  <link rel="stylesheet" href="external/highlight_default.css">
  <script>hljs.initHighlightingOnLoad();</script>
  

  <style>
  #wrap {
    width:800px;
    margin-left: auto;
    margin-right: auto;
  }
  </style>

  <script>
  function start() {
    $(".md").each(function(){
      $(this).html(marked($(this).html()));
    });
  }
  </script>
 </head>
 <body onload="start();">

  <a href="https://github.com/qqiang00"><img style="position: absolute; top: 0; right: 0; border: 0;" src="https://camo.githubusercontent.com/e7bbb0521b397edbd5fe43e7f760759336b5e05f/68747470733a2f2f73332e616d617a6f6e6177732e636f6d2f6769746875622f726962626f6e732f666f726b6d655f72696768745f677265656e5f3030373230302e706e67" alt="Fork me on GitHub" data-canonical-src="https://s3.amazonaws.com/github/ribbons/forkme_right_green_007200.png"></a>
   <div id="wrap">
   
    <div id="mynav" style="border-bottom:1px solid #999; padding-bottom: 10px; margin-bottom:50px;">
      <div>
        <img src="loop.svg" style="width:50px;height:50px;float:left;">
        <h1 style="font-size:50px;">强化学习<span style="color:#085;">示例</span></h1>
      </div>
      <ul class="nav nav-pills">
        <li role="presentation" class="active"><a href="index.html">首页</a></li>
        <li role="presentation"><a href="demo_iteration.html">迭代演示</a></li>
        <li role="presentation"><a href="car_rent.html">租车示例</a></li>
        <li role="presentation"><a href="gridworld_dp.html">格子世界DP</a></li>
        <li role="presentation"><a href="gridworld_td.html">格子世界: TD</a></li>
        <li role="presentation"><a href="puckworld.html">PuckWorld: DQN</a></li>
        <li role="presentation"><a href="waterworld.html">WaterWorld: DQN</a></li>
      </ul>
    </div>

   <div id="exp" class="md">

# About

**强化学习示例** 包括:

### DP解决简单格子世界问题

简要介绍简单格子世界问题的DP解决

### TD学习解决简单格子世界问题

包括SARSA, SARSA(λ)和Q-Learning

### TD学习有风格子世界问题

介绍有风格子世界

### 策略梯度——区域追踪

介绍区域追踪世界的规则

### 策略梯度——捕捉游戏

简要介绍捕捉游戏规则



<br><br><br><br><br>
   </div>

   </div>
 </body>
</html>